A Survey on Preprocessing Techniques in Web Usage Mining
نویسنده
چکیده
The World Wide Web (WWW) continues to grow at an overwhelming rate in both the sheer volume of traffic and the size and complexity of Web sites. Therefore, it becomes more and more necessary, but difficult to get useful information from Web data, in order to understand and better serve the needs of Web-based applications. As a result, the Web usage mining has become a hot research topic, which combines two of the prominent research areas comprising the data mining and the World Wide Web. Web usage mining consists of three phases, namely preprocessing, pattern discovery, and pattern analysis. For the survey, I focus on the first phase—data preprocessing, which is essential and must be performed prior to applying data mining algorithms to the data sources. An overview of data preprocessing techniques aiming at identifying unique users, user sessions and transactions is presented in this survey.
منابع مشابه
A Survey on Preprocessing Methods for Web Usage Data
World Wide Web is a huge repository of web pages and links. It provides abundance of information for the Internet users. The growth of web is tremendous as approximately one million pages are added daily. Users’ accesses are recorded in web logs. Because of the tremendous usage of web, the web log files are growing at a faster rate and the size is becoming huge. Web data mining is the applicati...
متن کاملWeb Usage Mining Tools & Techniques: A Survey
--The Quest for knowledge has led to new discoveries and invention. That leads to amelioration of various technologies. As years passed World Wide Web became overloaded with information and it became hard to retrieve data according to the need .Web mining came as a violence to provide solution of above problem. Web usage mining is category of web mining. Web usage mining mainly circulation with...
متن کاملAn Algorithmic Approach to Data Preprocessing in Web Usage Mining
Web usage Mining is an area of web mining which deals with the extraction of interesting knowledge from logging information produced by web server. Different data mining techniques can be applied on web usage data to extract user access patterns and this knowledge can be used in variety of applications such as system improvement, web site modification, business intelligence etc. Web usage minin...
متن کاملA Survey of Preprocessing Method for Web Usage Mining Process
The amount of web applications are increasing in large amount and users of web applications are also increasing rapidly with high speed. By increasing number of users the size of log file also increases .The information which stores in log files cannot be directly used for analysis. Therefore preprocessing of log files is necessary to improve the quality of web usage mining process. Preprocessi...
متن کاملSessionization –A Vital Stage in Data Preprocessing of Web Usage Mining-A Survey
The World Wide Web has impacted on almost ever aspects of our lives in modern era. The Web has many unique characteristics and which make mining useful information and knowledge a challenging task. Web mining uses many data mining techniques but it is not an application of traditional data mining due to heterogeneity and unstructured nature of the data on Web. Web mining tasks can be categorize...
متن کامل